Vocal Tract Normalization Based on Formant Positions

نویسندگان

  • Nikša Jakovljević
  • Dragiša Mišković
  • Milan Sečujski
  • Darko Pekar
چکیده

This paper presents our initial results in a new approach to vocal tract normalization (VTN). In experiments based on continuous automatic speech recognition (ASR) the VTN procedure is in general carried out in both training and test phase. In the training phase it is used to obtain speaker independent acoustic models of phones. In the test phase it is used to convert input observations into observations nearer to the ones corresponding to the universal speaker. The approach described in this paper is new, because instead of training a single set of acoustic models for the universal speaker, several sets of acoustic phone models corresponding to speakers with similar vocal tract lengths were created. Instead of using the VTN procedure in the test phase, the recognized sequence estimated as the most likely one among sequences based on different acoustic model sets was identified as the final recognition result. Normiranje vokalnega trakta na podlagi lege formantov V prispevku so predstavljeni začetni rezultati novega pristopa k normiranju vokalnega trakta (NVT). V eksperimentih, ki temeljijo na samodejnem razpoznavanju tekočega govora, se postopek NVT izvaja tako v učni kot v testni fazi. V učni fazi se uporablja za pridobivanje akustičnih modelov fonov, ki niso odvisni od govorca. V testni fazi se uporablja za pretvorbo vhodnih opažanj v opažanja, ki so bližja tistim, ki ustrezajo univerzalnemu govorcu. Pristop, ki je opisan v tem prispevku, je nov: namesto da bi učili posamezno množico akustičnih modelov za univerzalnega govorca, je bilo ustvarjenih več množic akustičnih modelov fonov, ki ustrezajo govorcem s podobno dolžino vokalnega trakta. Namesto uporabe postopka NVT v testni fazi je bil končni rezultat razpoznavanja prepoznano zaporedje, ki je bilo ocenjeno kot najbolj verjetno med zaporedji, temelječimi na različnih množicah

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Short-Time Estimation of Vocal Tract Length from Formant Frequencies

Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework ...

متن کامل

On instantaneous vocal tract length estimation from formant frequencies

The length of the vocal tract and its relationship with formant frequencies is examined at fine temporal scales with the goal of providing accurate estimates of vocal tract length from acoustics on a spectrum-by-spectrum basis despite unknown articulatory information. Accurate vocal tract length estimation is motivated by applications to speaker normalization and biometrics. Analyses presented ...

متن کامل

Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images

Three dimensional vocal tract shapes of three males and three females were measured from the magnetic resonance images that were taken during sustained phonation of the five Japanese vowels. Dimensional differences in the vocal tract length of the subjects were quantitatively measured by dividing the entire vocal tract into the oral, the pharyngeal and the laryngeal sections. To investigate mai...

متن کامل

Speaker normalization based on frequency warping

In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependence of the speech feature, and the variation of vocal tract shape is the major source of inter-speaker variations of the speech feature, though there are some other sources which also contribute. In this paper, we address the approaches of speaker normalization which aim at normalizing speaker's v...

متن کامل

Speaker normalization based on test to reference speaker mapping

The paper presents the speaker normalization technique we implemented in a teaching and training system for hearing handicapped children with the goal to reduce inter-speaker variability in time-frequency speech representation. In an effort to reduce variance caused by variation in vocal tract shape among speakers, a formant based nonlinear frequency warping approach to vocal tract normalizatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006